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[0001} LOW COMPLEXITY DATA DETECTION USING 

FAST FOURIER TRANSFORM OF CHANNEL CORRELATION MATRIX 

[0002] BACKGROUND 

[0003] The invention generally relates to wireless communication systems. In 
particular, the invention relates to data detection in a wireless communication system. 
[0004] Figure 1 is an illustration of a wireless communication system 10. The 
communication system 10 has base stations 12 x to 12 5 which communicate with user 
equipments (UEs) 14 X to 14 3 . Each base station has an associated operational area, 
where it communicates with UEs 14^ to 14 3 in its operational area. 
[0005] In some communication systems, such as code division multiple access 
(CDMA) and time division duplex using code division multiple access 
(TDD/CDMA), multiple communications are sent over the same frequency spectrum. 
These communications are differentiated by their channelization codes. To more 
efficiently use the frequency spectrum, TDD/CDMA communication systems use 
repeating frames divided into time slots for communication. A communication sent 
in such a system will have one or multiple associated codes and time slots assigned to 
it. The use of one code in one time slot is referred to as a resource unit. 
[0006] Since multiple communications may be sent in the same frequency 
spectrum and at the same time, a receiver in such a system must distinguish between 
the multiple communications. One approach to detecting such signals is multiuser 
detection. In multiuser detection, signals associated with all the UEs 14 X to 14 3i users, 
are detected simultaneously. Approaches for implementing multiuser detection 
include block linear equalization based joint detection (BLE-JD) using a Cholesky or 
an approximate Cholesky decomposition. 

[0007] Another approach is single user detection. In single user detection, data 
is only recovered for a single user (one UE 14j). Based on the application, the single 
user detected data may have been sent using one or multiple codes. Approaches for 
implementing single user detection include block linear equalization using a Cholesky 
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or an approximate Cholesky decomposition. These approaches have a high 
complexity. The high complexity leads to increased power consumption, which at the 
UE 14j results in reduced battery life. Accordingly, it is desirable to have alternate 
approaches to detecting received data. 

[0008] SUMMARY 

[0009] A combined signal is received over a shared spectrum in a time slot in 
a time division duplex communication system using code division multiple access. 
Each data signal experiences a similar channel response. The similar channel response 
is estimated. A matrix representing a channel of the data signals based on in part the 
estimated channel response is constructed. A spread data vector is determined based 
on in part a fast fourier transform (FFT) decomposition of a circulant version of the 
channel matrix. The spread data vector is despread to recover data from the received 
combined signal. 



[0010] BRIEF DESCRIPTION OF THE DRAWING(S) 

[001 1] Figure 1 is a wireless communication system. 

[0012] Figure 2 is a simplified transmitter and a single user detection receiver. 

[0013] Figure 3 is an illustration of a communication burst. 

[0014] Figure 4 is a flowchart of low complexity data detection. 

[0015] Figures 5-15 are graphs of the performance of low complexity data 

detection. 



[0016] DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 
[0017] Figure 2 illustrates a simplified transmitter 26 and receiver 28 using low 
complexity data detection in a TDD/CDMA communication system. In a typical 
system, a transmitter 26 is in each UE 14j to 14 3 and multiple transmitting circuits 26 
sending multiple communications are in each base station 12j to 12 5 . The low 
complexity data detector receiver 28 may be at a base station 12 v UEs 14 3 to 14 3 or 
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both. The receiver 28 can be used at a UE 14 1 for either multiuser or single user 
detection of a medium to high data rate service, such as a 2 megabits per second 
(Mbs). The receiver 28 can also be used at a base station 12 v when only a single UE 
14 1 transmits in a time slot 

[001 8] The transmitter 26 sends data over a wireless radio channel 30. A data 
generator 32 in the transmitter 26 generates data to be communicated to the receiver 
28. A modulation/spreading sequence insertion device 34 spreads the data and makes 
the spread reference data time-multiplexed with a midamble training sequence in the 
appropriate assigned time slot and codes for spreading the data, producing a 
communication burst or bursts. 

[001 9] A typical communication burst 1 6 has a midamble 20, a guard period 1 8 
and two data bursts 22, 24, as shown in Figure 3. The midamble 20 separates the two 
data bursts 22, 24 and the guard period 18 separates the communication bursts to 
allow for the difference in arrival times of bursts transmitted from different 
transmitters 26, The two data bursts 22, 24 contain the communication burst's data. 
[0020] The communication burst(s) are modulated by a modulator 36 to radio 
frequency (RF). An antenna 38 radiates the RF signal through the wireless radio 
channel 30 to an antenna 40 of the receiver 28. The type of modulation used for the 
transmitted communication can be any of those known to those skilled in the art, such 
as quadrature phase shift keying (QPSK) or an N-ary quadrature amplitude modulation 
(QAM). 

[0021] The antenna 40 of the receiver 28 receives various radio frequency 
signals. The received signals are demodulated by a demodulator 42 to produce a 
baseband signal. The baseband signal is processed, such as by a channel estimation 
device 44 and a low complexity data detection device 46, in the time slot and with the 
appropriate codes assigned to the received bursts. The channel estimation device 44 
uses the midamble training sequence component in the baseband signal to provide 
channel information, such as channel impulse responses. The channel information is 
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used by the data detection device 46 to estimate the transmitted data of the received 
communication bursts as hard symbols. 

[0022] The data detection device 46 uses the channel information provided by 
the channel estimation device 44 and the known spreading codes used by the 
transmitter 26 to estimate the data of the desired received communication burst(s). 
Low complexity data detection is explained in conjunction with the flowchart of 
Figure 4. Although low complexity data detection is explained using the third 
generation partnership project (3GPP) universal terrestrial radio access (UTRA) TDD 
system as the underlying communication system, it is applicable to other systems. 
That system is a direct sequence wideband CDMA (W-CDMA) system, where the 
uplink and downlink transmissions are confined to mutually exclusive time slots. 
[0023] The receiver 28 receives using its antenna 40 a total of K bursts that 
arrive simultaneously, 48. The K bursts are superimposed on top of each other in one 
observation interval. Some or all of the K bursts may arise from or go to the same 
users for higher data rate services. For the 3GPP UTRA TDD system, each data field 
of a time slot corresponds to one observation interval. 

[0024] A k h burst of the K bursts uses a code of of length Q chips to 
spread each of its N s symbols to yield a sequence of length Q - N s chips. The k th burst 
passes through a channel with a known or estimated channel response, h {k ) , of length 
W chips to form a chip sequence of length, N c = (SF • N 5 + W - 1) . SF is the 
spreading factor. Since uplink signals may originate from multiple UEs 14j to 14 3> 
each h {k) in the uplink may be distinct. For the downlink in the absence of transmit 

diversity, all bursts pass through the same channel and have the same h (k) . At the 

receiver 28, the bursts from all users arrive superimposed as a single received vector 
, r , Some or all of the K bursts may be part of a multi-code transmission. The 

multi-codes have the same h {k) , because they originate from the same transmitter 26. 



WO 02/063779 



PCT/US02/02400 



-5- 



[0025] 



The multi-user signal model consists of N c known received chips and 



K ■ N s unknown information bearing symbols. The symbol response, s { ) , of the 

l<f h burst is the convolution of C_ (k) with h {k) . Accordingly, i ik) is of lengtli (SF+W-l) 
cliips. Wis the impulse response, which represents the trail of chips left by a unity 
symbol. The N s unknown symbols of the Jd h burst form a column vector d_ (k ] . r {k } is 

the contribution of the kf h burst to the overall received chip vector, r . d} k) is the data 

vector for the k? h burst. d} k ) and r (k ) are related by Equation 1 . 

= A {k) d ik) , wherelc= 1...K Equation 1 

[0026] A w is the channel response matrix for the K h burst, which is an N c x 
matrix whose j ih column is the symbol-response of die element of d_ {k) . Assuming a 

time-invariant symbol-response, each column of A <k) has the same support, s ( k ) , and 

successive columns are zero-padded and shifted versions of the first column. The 
overall, chip-rate, received vector is per Equation 2. 



n is a zero-mean noise vector with independent identical distribution (Lid.) 

components of the Variance, cr 2 . Equation 2 becomes Equation 3, when written as 
a single matrix equation. 




Equation 2 



Equation 3 
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[0027] 



A is the overall channel response matrix, which is a matrix of size 



N c xK -N s . ± is the data vector, which is a column vector of length K-N s . 
Equation 2 and Equation 3 model the inter-symbol interference (ISI) and multiple- 
access interference (MAI) in the received vector, r . 



sampling, such as 3.84 Mega chips per second (Mcps) in 3GPP UTRA system. For 
increased statistical accuracy, a receiver 28 may use over-sampling, such as a multiple 
chip rate sampling. A typical multiple chip rate sampling is twice the chip rate, 
although other multiples may be used. When using multiple chip rate sampling, the 
received signal burst will be over-sampled generating multiple sampled sequences. 
Each sequence is sampled at the chip rate with different time offsets with respect to 
one another. The kf h burst passes through a channel with a known or estimated 

channel response, , for the m th sampled sequence, is the contribution of the 
K h burst to the m tb overall sampled chip vector, r, H . The data symbol vectors d} k) and 
the m fh sampled chip vector r*f ) are related by Equation 4. 



Aj n k) is the symbol response matrix for the m ,u sequence, It is a matrix of size 
N c x N 5 , whose /* column is the m lh sampled symbol-response of the/ 1 element of 



[0028] 



The signal models of Equations 1, 2 and 3 are formulated for chip rate 



Equation 4 




[0029] 



Equation 5 is the overall, chip-rate, received vector, r,„ , of the rri 



Ah 



sampled sequence. 
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U = S -» * * - ro = 1. . . M Equation 5 



[0030] For an M multiple of chip rate sampling, a single matrix expression is 
per Equation 6. 



r =A'd +rc 



Equation 6 



[0031] r is the received signal vector and is defined as per Equation 7. 



r = 



L2 



LM 



Equation 7 



[0032] A ' is defined as per Equation 8. 



A' = 



A, 



Equation 8 



[0033] Equation 9 is Equation 6 rewritten as a summation form of K bursts. 
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[0034] Equation 9 can be rewritten as Equation 10. 

r ==£ff ** } C ( * } fiL (i) +n Equation 10 



[0035] C (k) is code sequence of the ** burst. H' (k) is the channel response for 

the k th sequence, which is defined for M multiple chip rate sampling per Equation 1 1 . 



H 



Equation 11 



[0036] When all the signal bursts in a time slot arise from the same user in the 
uplink or go to the same user in the downlink, the bursts pass through the same 
propagation path and, accordingly, the same fading channel As a result, H /{k) is the 
same for all bursts ( H /{k) = H Ai) = H' c , for all k and j) and is replaced in Equation 
10 with H' c as per Equation 12. 

r =H^C {ky d w ^n Equation 12 
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[0037] Equation 13 is Equation 12 rewritten as a single matrix expression. 
L =H{Cd+n Equation 13 



[0038] C is the code matrix. For M chip rate sampling, H{ is per Equation 14. 



cl 



c2 



L H cM 



Equation 14 



[0039] For an m th chip rate sample, H cm is the channel response for the m th 
sampled sequence. Each H em , m =1... M , is determined by the channel estimation 
device 44, 50. The matrix structure of each H cm is per Equation 15, 52. 



WO 02/063779 



PCT/US02/02400 



-10- 



H rn , — 





0 






0 


K \ 
tii ,i 




0 






K, -> 

III ,£ 


K i 

IH t l 










h m 2 








It'.., 11/ "3 

m t rY —J 


; 




0 




in ,?v *- i 


IV _*J 




h m q 


0 




K\ ,W -2 




h , 


ft A 


0 






K t 2 






0 






K,2 






0 


































K t W-2 


0 






0 





Equation 15 



[0040] The overall signal model of data detection is represented as per 
Equations 16 and 17. 

L = Jf c 's+n Equation 16 

j = Cd Equation 17 
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[0041] i is the spread data chip vector. C is the code vector. One approach to 
determine s is to use a zero forcing (ZF) solution of Equation 16 as per Equation 18. 

s = (H' C H IV* r Equation 1 8 

[0042] H' C H is the hermitian of H' c . Another approach is to use a minimum 
mean square error (MMSE) solution as per Equation 19, 

£ = U' c -her 2 /)" 1 r Equation 1 9 

[0043] a 1 is the noise variance. I is the identity matrix. After solving either 

Equation 17 or 18 for s , the solution of Equation 17 is obtained by despreading, as 
represented by Equation 20, 56. 

<L — C H I Equation 20 

[0044] The following approaches to solve Equations 1 8 and 1 9 for s use a fast 

fourier transform (FFT) decomposition of either a circulant approximation of the 
channel correlation matrix, R, or the channel response matrix, IV C , 54. Using either 

matrix requires an approximation; however, using the channel response matrix, H c ' , 

also requires truncation of the last W-l rows of the matrix to make it square. 
Accordingly, to eliminate degradation due to truncation, the channel correlation 
matrix, R, is preferably used. 
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[0045] A FFT decomposition of the channel correlation matrix, R> is performed 
as follows. For a ZF approach, R is defined as per Equation 21. 

M 

R =H;"H' e =^H? m H m Equation21 
»i=i 

[0046] For a MMSE approach, R is defined as per Equation 22. 

R=H^H^ + a 2 I Equation 22 

[0047] The structure of the channel correlation matrix , R, is represented as per 
Equation 23. 
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Equation 23 



[0048] Equations 1 8 and 19 are rewritten in terms of R as per Equations 24 and 
25, respectively. 

s_ = R- x H' c H l Equation 24 

i?£ = # c '"r Equation 25 

[0049] The matrix-vector multiplication Rs can be viewed as a linear 

combination of column vectors of the channel correlation matrix, /?, weighted by the 
corresponding elements of data chip vector j , as per Equation 26. 
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Equation 26 

[0050] g t is the i" 1 column of the channel correlation matrix R. s t is the i"' 
element of spread data chip vector s . 

[0051] By modifying the structure of matrix R, an optimum circulant matrix 
approximation of channel correlation matrix, R clr , can be determined using Equation 

27. 
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[0052] The first column, g , has the full non-zero elements without any 

truncation. The circulant matrix, R cir , is defined by its first column q_ . The first 

column 4 of circulant matrix, R cir , is obtained by permuting the W h column g of 

the channel correlation matrix, R, using the permutation operator or index vector as 
defined by Equation 28. 

p =[W:N <Q,l:W -l] Equation 28 

[0053] Alternately, a circulant matrix is also defined by the W h column g of 

channel correlation matrix, R. In general, any column greater than W h column may 
be used with a proper index vector (permutation vector). 

[0054] This alternate approximate circulant channel correlation matrix, c ',, , 
relates to R cir per Equation 29. 

Kir = K ar ( : >P) • Equation 29 

[0055] The advantage with this approach is that g is used directly without 

permutation. However, the solved spread data chip vector s is required to be inverse 
permuted by the index vector J as per Equation 30. 

[0056] By permuting the firstrow in the previous approach, the need for inverse 
permuting £ is eliminated. 



p=[N -SF -W +2 : N SF,l : N SF -W -hi] Equation 30 
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[0057] Equation 31 is the FFT decomposition of matrix R cir . 

R cir =D; l A R D P Equation 31 

[0058] d p is the P-point FFT matrix and is diagonal matrix, whose 

diagonal is the FFT of the first column of matrix R cir . A R is defined as 
A R =diag(D F q). 

[0059] Using a FFT decomposition of the channel response matrix, # c '» is 
performed as follows. Matched filtering, U[ H r , is represented by Equation 32. 

Hfr'^H^r. Equation 32 

[0060] The channel response matrix that corresponds to each sampled sequence, 
H Cfn , m =1,2,..., M , are circulant matrixes. Each matrix can be decomposed into 

three FFT matrix multiplication as per Equation 33. 

H cm =D; x A Stm D P , m=z\...M Equation 33 

[0061] As a result, the decomposition of the channel response matrix is per 
Equation 34. 



Equation 34 



WO 02/063779 



PCT/US02/02400 



-17- 



[0062] To recover the data chip vector $ , Equation 35 is used. 



s - R£H' t a r = D? A", 1 J) K H(m D P r. Equation 35 

/H=l 



[0063] In the frequency domain, Equation 35 becomes Equation 36. 



F(s) = Equation 36 

Fit) 



[0064] ® represents the operation of element by element multiplication. Using 
Equation 36, F(s) is determined, By taking the inverse transform of F(s) , the 

spread data vector, s , is determined. If used for multi-user detection in the downlink 

or a single user solely uses one time slot in the uplink, $ is despread by using all of 

the codes to recover the transmitted data d as soft symbols. If used for single user 

detection in the downlink, i is despread using that user's codes to recover that user's 

data as soft symbols. Hard decisions are made to convert the soft symbols to hard 
symbols. 

[0065] Two approaches to implement the FFT composition are a prime factor 
algorithm (PFA) and a radix-2 algorithm. Although a PFA is considered more 
efficient than a radix-2 algorithm when a non-power-of-two number of FFT points is 
used, the following complexity analysis is based on a radix-2 FFT implementation for 
simplicity. The complexity based on radix-2 algorithm can be considered as the worst 
case. Additional improvement in complexity is obtainable when PFA is used. Zero- 
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padding radix-2 EFT implementation entails the zero-padding the first column of H cm , 
m=l...M , the vectors r m , m = 1. . . M and £ . The zero-padding makes their length 

equal to the nearest radix-2 integer that is greater than or equal to die length of a data 
field. For example, the length of a data field is 976 chips for burst type 1 in a TDD 
burst specified by 3 GPP W-CDMA standard. The nearest radix-2 integer of 976 is 
1024 (P = 1024). P is the radix-2 integer. 

[0066] Four types of radix-2 FFT computations are required: 
DpL n > Dpllm* d pS x Two of the computations are computed M times for 

all sampled sequences: D P r m , m = 1. M and D P h m , m =1...M .The other two 
are performed only once for the sampled sequences . D p h m , m : = 1 . . . M and D P are 

computed once per time slot. D P i m , m = 1 . . . M , are computed twice per time 

slot. As a result, a total of 3(M + 1) radix-2 FFT computations are required. Each 
needs Plog 2 P complex operations. By assuming each complex operation requires four 
real operations, the complexity for radix-2 FFT computations in terms of million real 
operations per second (MROPS) is per Equation 37. 

C, =3(Af +l)/ > log 2 P«4-100 10" 6 MROPS Equation 37 

[0067] For the complexity of the vector multiplications, there are M element-to- 
element vector multiplications and one element-to-element vector division, which are 
performed twice per time slot. As a result, the complexity for the vector operations 
in terms of MROPS is per Equation 38. 



WO 02/063779 



PCT/US02/02400 



-19- 

C 2 =2(M +1)P.4-100-10^ MROPS Equation 38 

[0068] For the complexity of calculating the vector q , it requires MW 2 complex 

operations, which are performed once per time slot The complexity in terms of 
MROPS is per Equation 39. 

C 3 =MW 2 . 4400-10^ MROPS Equation 39 

[0069] The total complexity except for the despreading in MROPS is per 
Equation 40. 

C /; , = Cj + C 2 + C 3 MROPS Equation 40 

[0070] Despreading is performed twice per time slot. The complexity of 
despreading in terms of MROPS is per Equation 41. 

C desp = 2 • K • N • Q - 4 • 1 00 • 1 (T* MROPS Equation 41 

[0071] As a result, the total complexity of the data detection including 
despreading is per Equations 42 or 43. 

Crotni = Cjf, + ?desp MROPS Equation 42 

C Total " [3(W +l)/ > log 2 ^ + 2(M +1)? + MW 2 + 2£JV£).4 -100-10" 6 MROPS 

Equation 43 
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[0072] The following tables show the complexity in MROPS for a 1 024-point 
radix-2 (P = 1 024) computation. Complexity is shown in Tables 1 at the chip rate and 
at Table 2 at twice the chip rate sampling. A complexity comparison is made in 
MROPS between BLE-JD using approximate Choleslcy decomposition and low 
complexity data detection, as shown in Tables 3 and 4. Table 5 is a complexity 
comparison showing the complexity of low complexity data detection as a percentage 
of the complexity of BLE-JD using approximate Cholesky decomposition. As shown, 
low complexity data detection has a much lower complexity than approximate 
Cholesky based BLE-JD. Depending on the number of bursts transmitted and 
spreading factors, for most cases, low complexity data detection is 25% at the chip 
rate, and 30% at twice the chip rate, of the complexity of approximate Cholesky based 
BLE-JD. 



[0073] Table 1. MROPS of a full-burst using low complexity data detection for 
burst type 1 at chip rate sampling. 
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m =l...M 


P 










Via 


Radix-2 


Via Radix-2 


Via 










Radix-2 


FFT 


FFT 


Radix-2 










FFT 






FFT 






1 


1.3 


4.1 


4.1 


8.2 


8.2 


0.78 


26.7 


8 


1.3 


4.1 


4.1 


8.2 


8.2 


6.25 


32.2 


12 


1.3 


4.1 


4.1 


8.2 


8.2 


9.4 


35.3 


13 


1.3 


4.1 


4.1 


8.2 


8.2 


10.1 


36 


14 


1.3 


4.1 


4.1 


8.2 


8.2 


10.9 


36.8 


16 


1.3 


4.1 


4.1 


8.2 


8.2- 


12.5 


38.4 



[0074] Table2. MROPS of a full-burst using low complexity data detection for 
burst type 1 and twice the chip rate sampling. 
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Funcs Executed once 


per burst 


Funcs executed twice per half-burst 




#of 
bursts, 

K 


Construct 
Q 


Compute 


Compute 


Compute 


Compute 
Dp(-) 


Despread 
C j 


Total 




m=\...M 
Via 
Radix-2 
FFT 


Via 
\ Radix-2 
FFT 


m = 1...M 
Via Radix-2 
FFT 


P 

Via 
Radix-2 
FFT 








1 


2.6 


8.2 


8.2 


16.4 


16.4 


0.78 


52.6 


8 


2.6 | 


8.2 


8.2 


16.4 


16.4 


6.25 


58.1 


12 


2.6 


8.2 


8.2 


16.4 


16.4 


9.4 


61.2 


13 


2.6 


8.2 


8.2 


16.4 


16.4 


10.1 


61.9 


14 


2.6 


8.2 


8.2 


16.4 


16.4 


10.9 


62.7 


16 


2.6 


8.2 


8.2 


16.4 


16.4 


12.5 


64.3 



[0075] Table 3. Comparison in MROPS between BLE-JD (approximate 
Choleslcy decomposition) and low complexity data detection at chip rate sampling. 



Spreading Factor, Q 


# of bursts, 
K 


Proposed algorithm 


BLE-JD 


1 


1 


26.7 . 


318.2 


16 


8 


32.2 


81.1 




12 


35.3 


174.6 




13 


36 


205.5 




14 


36.8 


239.4 




16 


38.4 


318.2 



[0076] Table 4. Comparison in MROPS between BLE-JD (approximate 
Choleslcy decomposition) and low complexity data detection at twice the chip rate 



sampling. 



Spreading Factor, Q 


# of bursts, 
K 


Proposed algorithm 


BLE-JD 


1 


1 


52.6 


427.6 


16 


8 


58.1 


124.8 




12 


61.2 


248.3 




13 


61.9 


287.7 




14 


62.7 


330.4 




16 


64.3 


427.6 
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[0077] Table 5. Complexity of FFT of die channel correlation matrix as a 
percentage of the complexity of approximate Cholesky based BLE-JD. Approximate 
Cholesky based BLE-JD is set at 100% complexity. 



Spreading Factor, Q 


# of bursts, K 


Chip rate sampling 


Twice the chip 
rate sampling 


1 


1 


8% 


12% 


16 


8 


39% 


47% 




12 


20% 


25% 




13 


18% 


22% 




14 


15% 


19% 




16 


12% 


15% 



[0078] Figures 5-15 are graphs of the performance of low complexity data 
detection. Two high date rate services are simulated. One is single-code transmission 
with SF = 1 and the other is multi-code transmission with twelve codes and spreading 
factor 1 6 for each. Low complexity data detection is tested under various delay spread 
types including 3GPP working group four (WG4) defined delay spread channel cases 
1, 2 and 3. The simulations are set for both chip rate and twice the chip rate sampling. 
The length of delay spread is assumed W- 57. Zero timing error is assumed through 
the whole simulations. The channel impulse response is assumed to be exactly known. 
In general, the bit error rate (BER) performance of the multi-code case is better than 
its corresponding single-code counterpart in the simulation. For the particular 
example used in the simulation, single-code transmission uses 16 resource units per 
time slot while the multi-code transmission uses only 12 resource units in each time 
slot. Using only 12 codes produces less interference and therefore better BER. As 
compared with BLE-JD, only little or limited performance degradation are observed 
for proposed algorithm based on FFT decomposition of the channel correlation mati ix 
(FFf-R) in both single-code and multi-code cases. In single-code case, the FFT-R 
based approach is identical to the block linear equalization structure. The proposed 
FFT-R based approach and the approach based on FFT of the channel response matrix 
(FFr-H) are identical to each other at the chip rate sampling. 
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[0079] The performance of low complexity data detection using FFT-R and 
FPT-H is compared to an ideal single user bond, a worst case matched filtering, BLE- 
JD and single user detection with BLE using an approximate Cholesky decomposition. 
For the working points of interest, the BER range was typically between 1 % and 1 0%. 
Only a little or limited signal to noise ratio (SNR) performance degradations are 
observed for low complexity data detection as compared with BLE- JD, and significant 
SNR performance enhancement over matched filtering (MF). Low complexity data 
detection also performs well in an additive white gaussian noise (AWGN) channel 
environment. Figures 5-15 show that low complexity data detection offers very 
comparable performance in BER or SNR at much lower complexity and power 
consumption as compared to BLE-JD using approximate Cholesky decomposition. 

* * * 



WO 02/063779 



PCT/US02/02400 



-24- 
CLAIMS 

What is claimed is: 

1 . A method for use in receiving a plurality of data signals transmitted over 
a shared spectrum in a time slot in a time division duplex communication system using 
code division multiple access, each data signal experiencing a similar channel 
response, the method comprising: 

receiving a combined signal over the shared spectrum in the time slot, the 
combined signal comprising the plurality of data signals; 

sampling the combined signal at a multiple of a chip rate of the combined 

signal; 

estimating the similar channel response; 

constructing a channel response matrix based on in part the estimated channel 
response; 

determining a spread data vector based on in part a fast fourier transform (FFT) 
decomposition of a circulant version of the channel response matrix; and 

despreading the spread data vector to recover data from the received combined 

signal 

2. The method of claim 1 wherein the multiple of chip rate is twice the chip 

rate. 

3. The method of claim 1 wherein the determining is performed using a 
zero forcing algorithm. 

4. The method of claim 1 wherein the determining is performed using a 
minimum mean square error algorithm. 

5 . The method of claim 1 for use in downlink multiuser detection wherein 
the despreading is performed using all codes used in the time slot. 
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6. The method of claim 1 for use in single user detection wherein the 
despreading is performed using codes associated with a single user in the time slot. 

7 » The method of claim 6 wherein the single user detection is uplink single 
user detection and the single user is an only user transmitting in the time slot. 

8. A method for use in receiving a plurality of data signals transmitted over 
a shared spectrum in a time slot in a time division duplex communication system using 
code division multiple access, each data signal experiencing a similar channel 
response, die method comprising: 

receiving a combined signal over the shared spectrum in the time slot, the 
combined signal comprising the plurality of data signals; 
estimating the similar channel response; 

constructing a channel correlation matrix based on in part the estimated channel 
response; 

determining a spread data vector based on in part a fast fourier transform (FFT) 
decomposition of a circulant version of the channel correlation matrix; and 

despreading the spread data vector to recover data from the received combined 

signal. 

9. The method of claim 8 wherein the combined signal is sampled at a 
multiple of a chip rate of the combined signal prior to the steps of estimating and 
constructing. 

10. The method of claim 9 wherein the multiple of the chip rate is twice the 
chip rate. 

1 1 . The method of claim 8 wherein the combined signal is sampled at a chip 
rate of the combined signal prior to the steps of estimating and constructing. 
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12. The method of claim 8 wherein the FFT decomposition is performed 
using a permuted first row of the channel correlation matrix. 

13. The method of claim 8 wherein the FFT decomposition is performed 
using a defining row of the channel correlation matrix. 

14. The method of claim 8 wherein the determining is performed using a 
zero forcing algorithm. 

15. The method of claim 8 wherein the determining is performed using a 
minimum mean square error algorithm. 

1 6. The method of claim 8 for use in downlink multiuser detection wherein 
the despreading is performed using all codes used in the time slot. 

17. The method of claim 8 for use in single user detection wherein the 
despreading is performed using codes associated with a single user in the time slot. 

18. The method of claim 17 wherein the single user detection is uplink 
single user detection and the single user is an only user transmitting in the time slot. 

19. A receiver for use in a time division duplex communication system using 
code division multiple access, the system communicating using a plurality of data 
signals in a time slot, each data signal experiencing a similar channel response, the 
receiver comprising: 

an antenna for receiving radio frequency signals including the plurality of data 
signals; 

a demodulator for demodulating radio frequency signals to produce a baseband 

signal; 
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a channel estimation device for estimating the similar channel response at a 
multiple of a chip rate of the combined signal; and 

a data detector device for constructing a channel response matrix representing 
a channel of the data signals based on in part the estimated channel response, 
determining a spread data vector based on in part a fast fourier transform (EFT) 
decomposition of a circulant version of (lie channel response matrix, and despreading 
the spread data vector to recover data from the received combined signal. 

20. The receiver of claim 19 wherein the multiple of the chip rate is twice 
the chip rate. 

21 . A receiver for use in a time division duplex communication system using 
code division multiple access, the system communicating using a plurality of data 
signals in a time slot, each data signal experiencing a similar channel response, the 
receiver comprising: 

an antenna for receiving radio frequency signals including the plurality of data 
signals; 

a demodulator for demodulating radio frequency signals to produce a baseband 

signal; 

a channel estimation device for estimating the similar channel response; and 
a data detector device for consulting a channel correlation matrix representing 
a channel of the data signals based on in part the estimated channel response, 
determining a spread data vector based on in part a fast fourier transform (FFT) 
decomposition of a circulant version of the channel correlation matrix, and 
despreading the spread data vector to recover data from the received combined signal. 

22. The receiver of claim 21 wherein the combined signal is sampled at a 
multiple of a chip rate of the combined signal and the sampled combined signal is 
input into the channel estimation and data detector device. 
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23. The receiver of claim 22 wherein the multiple of the chip rate is twice 
the chip rate, 

24. The receiver of claim 21 wherein the combined signal is sampled at a 
chip rate of the combined signal and the sampled combined signal is input into the 
channel estimation and data detection device. 

25 . The receiver of claim 21 wherein the FFT decomposition is performed 
using a permuted first row of the channel correlation matrix. 

26. The receiver of claim 21 wherein the FFT decomposition is performed 
using a defining row of the channel correlation matrix. 
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